home *** CD-ROM | disk | FTP | other *** search
- Path: sdcc12.ucsd.edu!helix!mike
- From: mike@helix.ucsd.edu (Mike Lindelsee)
- Newsgroups: comp.databases,comp.databases.theory,comp.lang.c,comp.lang.c++
- Subject: matching strings and their plurals
- Date: 25 Mar 1996 19:15:34 GMT
- Organization: University of California, San Diego
- Message-ID: <4j6rcm$ms0@sdcc12.ucsd.edu>
- NNTP-Posting-Host: helix.ucsd.edu
-
- Hi,
-
- I'm doing some work where I'm trying to get strings and their plurals out
- of a database. Hopefully, searching on either will pull both out. The need
- here is that I won't know ahead of time if the database will have the plural
- or singular version (or both) and want to make sure I find whatever is there.
- Does anyone have any pointers for algorithms, heurisitics, or code to get me
- started?
-
- I'm thinking about using some kind of stemming algorithm and storing the
- keys in the database as just the stems of the words and searching on just the
- stem of whatever is being looked for. This sounds like something that
- someone has probably done before (and why reinvent the wheel).
-
- I'd appreciate it if responses could be emailed to me - I'll follow up with
- a summary if anyone expresses interest.
-
- Thanks for the assist!
-
- Mike
-